Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 487 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 42.9 KiB |
| Average record size in memory | 90.3 B |
Variable types
| NUM | 10 |
|---|---|
| BOOL | 2 |
| CAT | 1 |
Reproduction
| Analysis started | 2020-06-24 03:12:59.824444 |
|---|---|
| Analysis finished | 2020-06-24 03:14:03.111559 |
| Duration | 1 minute and 3.29 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
Gender_Male is highly correlated with Gender_Female | High correlation |
Gender_Female is highly correlated with Gender_Male | High correlation |
df_index has unique values | Unique |
| Distinct count | 487 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 253.3100616016427 |
|---|---|
| Minimum | 0 |
| Maximum | 499 |
| Zeros | 1 |
| Zeros (%) | 0.2% |
| Memory size | 3.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 25.6 |
| Q1 | 128.5 |
| median | 256 |
| Q3 | 377.5 |
| 95-th percentile | 474.7 |
| Maximum | 499 |
| Range | 499 |
| Interquartile range (IQR) | 249 |
Descriptive statistics
| Standard deviation | 144.1515705 |
|---|---|
| Coefficient of variation (CV) | 0.5690716332 |
| Kurtosis | -1.189132483 |
| Mean | 253.3100616 |
| Median Absolute Deviation (MAD) | 125 |
| Skewness | -0.0398491358 |
| Sum | 123362 |
| Variance | 20779.67527 |
| Value | Count | Frequency (%) | |
| 499 | 1 | 0.2% | |
| 193 | 1 | 0.2% | |
| 163 | 1 | 0.2% | |
| 165 | 1 | 0.2% | |
| 166 | 1 | 0.2% | |
| 167 | 1 | 0.2% | |
| 168 | 1 | 0.2% | |
| 169 | 1 | 0.2% | |
| 170 | 1 | 0.2% | |
| 171 | 1 | 0.2% | |
| Other values (477) | 477 | 97.9% |
| Value | Count | Frequency (%) | |
| 0 | 1 | 0.2% | |
| 1 | 1 | 0.2% | |
| 2 | 1 | 0.2% | |
| 3 | 1 | 0.2% | |
| 4 | 1 | 0.2% |
| Value | Count | Frequency (%) | |
| 499 | 1 | 0.2% | |
| 498 | 1 | 0.2% | |
| 497 | 1 | 0.2% | |
| 496 | 1 | 0.2% | |
| 495 | 1 | 0.2% |
Age
Real number (ℝ≥0)
| Distinct count | 70 |
|---|---|
| Unique (%) | 14.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 44.70225872689939 |
|---|---|
| Minimum | 4 |
| Maximum | 85 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.8 KiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 18 |
| Q1 | 32.5 |
| median | 45 |
| Q3 | 58 |
| 95-th percentile | 72 |
| Maximum | 85 |
| Range | 81 |
| Interquartile range (IQR) | 25.5 |
Descriptive statistics
| Standard deviation | 16.60357363 |
|---|---|
| Coefficient of variation (CV) | 0.3714258319 |
| Kurtosis | -0.6927595706 |
| Mean | 44.70225873 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | -0.05308916481 |
| Sum | 21770 |
| Variance | 275.6786574 |
| Value | Count | Frequency (%) | |
| 60 | 31 | 6.4% | |
| 48 | 20 | 4.1% | |
| 45 | 19 | 3.9% | |
| 50 | 19 | 3.9% | |
| 38 | 18 | 3.7% | |
| 42 | 16 | 3.3% | |
| 65 | 16 | 3.3% | |
| 55 | 15 | 3.1% | |
| 33 | 15 | 3.1% | |
| 75 | 14 | 2.9% | |
| Other values (60) | 304 | 62.4% |
| Value | Count | Frequency (%) | |
| 4 | 2 | 0.4% | |
| 6 | 1 | 0.2% | |
| 7 | 2 | 0.4% | |
| 8 | 1 | 0.2% | |
| 11 | 1 | 0.2% |
| Value | Count | Frequency (%) | |
| 85 | 1 | 0.2% | |
| 84 | 1 | 0.2% | |
| 78 | 1 | 0.2% | |
| 75 | 14 | 2.9% | |
| 74 | 4 | 0.8% |
Total_Bilirubin
Real number (ℝ≥0)
| Distinct count | 87 |
|---|---|
| Unique (%) | 17.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.6121149897330596 |
|---|---|
| Minimum | 0.4 |
| Maximum | 75.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.8 KiB |
Quantile statistics
| Minimum | 0.4 |
|---|---|
| 5-th percentile | 0.6 |
| Q1 | 0.8 |
| median | 0.9 |
| Q3 | 2.15 |
| 95-th percentile | 10.09 |
| Maximum | 75 |
| Range | 74.6 |
| Interquartile range (IQR) | 1.35 |
Descriptive statistics
| Standard deviation | 5.173507721 |
|---|---|
| Coefficient of variation (CV) | 1.98058192 |
| Kurtosis | 83.7488455 |
| Mean | 2.61211499 |
| Median Absolute Deviation (MAD) | 0.3 |
| Skewness | 7.389036994 |
| Sum | 1272.1 |
| Variance | 26.76518214 |
| Value | Count | Frequency (%) | |
| 0.8 | 80 | 16.4% | |
| 0.7 | 71 | 14.6% | |
| 0.9 | 51 | 10.5% | |
| 0.6 | 40 | 8.2% | |
| 1 | 23 | 4.7% | |
| 1.1 | 17 | 3.5% | |
| 1.8 | 11 | 2.3% | |
| 1.4 | 11 | 2.3% | |
| 1.3 | 11 | 2.3% | |
| 1.7 | 10 | 2.1% | |
| Other values (77) | 162 | 33.3% |
| Value | Count | Frequency (%) | |
| 0.4 | 1 | 0.2% | |
| 0.5 | 4 | 0.8% | |
| 0.6 | 40 | 8.2% | |
| 0.7 | 71 | 14.6% | |
| 0.8 | 80 | 16.4% |
| Value | Count | Frequency (%) | |
| 75 | 1 | 0.2% | |
| 30.5 | 1 | 0.2% | |
| 27.2 | 1 | 0.2% | |
| 23.3 | 1 | 0.2% | |
| 23.2 | 1 | 0.2% |
Direct_Bilirubin
Real number (ℝ≥0)
| Distinct count | 61 |
|---|---|
| Unique (%) | 12.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.1207392197125257 |
|---|---|
| Minimum | 0.1 |
| Maximum | 14.2 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.8 KiB |
Quantile statistics
| Minimum | 0.1 |
|---|---|
| 5-th percentile | 0.1 |
| Q1 | 0.2 |
| median | 0.3 |
| Q3 | 1 |
| 95-th percentile | 4.97 |
| Maximum | 14.2 |
| Range | 14.1 |
| Interquartile range (IQR) | 0.8 |
Descriptive statistics
| Standard deviation | 2.084303741 |
|---|---|
| Coefficient of variation (CV) | 1.85975801 |
| Kurtosis | 14.35144273 |
| Mean | 1.12073922 |
| Median Absolute Deviation (MAD) | 0.2 |
| Skewness | 3.589204907 |
| Sum | 545.8 |
| Variance | 4.344322086 |
| Value | Count | Frequency (%) | |
| 0.2 | 176 | 36.1% | |
| 0.1 | 54 | 11.1% | |
| 0.3 | 42 | 8.6% | |
| 0.8 | 19 | 3.9% | |
| 0.4 | 18 | 3.7% | |
| 0.5 | 17 | 3.5% | |
| 0.6 | 15 | 3.1% | |
| 1.3 | 12 | 2.5% | |
| 1 | 12 | 2.5% | |
| 0.7 | 11 | 2.3% | |
| Other values (51) | 111 | 22.8% |
| Value | Count | Frequency (%) | |
| 0.1 | 54 | 11.1% | |
| 0.2 | 176 | 36.1% | |
| 0.3 | 42 | 8.6% | |
| 0.4 | 18 | 3.7% | |
| 0.5 | 17 | 3.5% |
| Value | Count | Frequency (%) | |
| 14.2 | 1 | 0.2% | |
| 12.8 | 1 | 0.2% | |
| 12.6 | 2 | 0.4% | |
| 11.8 | 1 | 0.2% | |
| 11.4 | 1 | 0.2% |
Alkaline_Phosphotase
Real number (ℝ≥0)
| Distinct count | 235 |
|---|---|
| Unique (%) | 48.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 297.90143737166323 |
|---|---|
| Minimum | 63 |
| Maximum | 2110 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.8 KiB |
Quantile statistics
| Minimum | 63 |
|---|---|
| 5-th percentile | 140 |
| Q1 | 175 |
| median | 205 |
| Q3 | 298 |
| 95-th percentile | 750 |
| Maximum | 2110 |
| Range | 2047 |
| Interquartile range (IQR) | 123 |
Descriptive statistics
| Standard deviation | 260.4012702 |
|---|---|
| Coefficient of variation (CV) | 0.8741188782 |
| Kurtosis | 15.4112214 |
| Mean | 297.9014374 |
| Median Absolute Deviation (MAD) | 47 |
| Skewness | 3.566574105 |
| Sum | 145078 |
| Variance | 67808.82154 |
| Value | Count | Frequency (%) | |
| 198 | 10 | 2.1% | |
| 182 | 9 | 1.8% | |
| 190 | 9 | 1.8% | |
| 195 | 9 | 1.8% | |
| 145 | 8 | 1.6% | |
| 180 | 8 | 1.6% | |
| 215 | 8 | 1.6% | |
| 298 | 8 | 1.6% | |
| 202 | 7 | 1.4% | |
| 282 | 7 | 1.4% | |
| Other values (225) | 404 | 83.0% |
| Value | Count | Frequency (%) | |
| 63 | 1 | 0.2% | |
| 75 | 1 | 0.2% | |
| 90 | 1 | 0.2% | |
| 92 | 2 | 0.4% | |
| 100 | 2 | 0.4% |
| Value | Count | Frequency (%) | |
| 2110 | 1 | 0.2% | |
| 1896 | 1 | 0.2% | |
| 1750 | 1 | 0.2% | |
| 1630 | 1 | 0.2% | |
| 1620 | 1 | 0.2% |
Alamine_Aminotransferase
Real number (ℝ≥0)
| Distinct count | 141 |
|---|---|
| Unique (%) | 29.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 81.63655030800821 |
|---|---|
| Minimum | 10 |
| Maximum | 2000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.8 KiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 15 |
| Q1 | 23 |
| median | 33 |
| Q3 | 59 |
| 95-th percentile | 231.4 |
| Maximum | 2000 |
| Range | 1990 |
| Interquartile range (IQR) | 36 |
Descriptive statistics
| Standard deviation | 193.4211628 |
|---|---|
| Coefficient of variation (CV) | 2.369296132 |
| Kurtosis | 47.30139426 |
| Mean | 81.63655031 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 6.412683064 |
| Sum | 39757 |
| Variance | 37411.74623 |
| Value | Count | Frequency (%) | |
| 25 | 22 | 4.5% | |
| 20 | 18 | 3.7% | |
| 18 | 17 | 3.5% | |
| 28 | 15 | 3.1% | |
| 30 | 15 | 3.1% | |
| 22 | 14 | 2.9% | |
| 21 | 14 | 2.9% | |
| 15 | 13 | 2.7% | |
| 36 | 11 | 2.3% | |
| 24 | 11 | 2.3% | |
| Other values (131) | 337 | 69.2% |
| Value | Count | Frequency (%) | |
| 10 | 4 | 0.8% | |
| 11 | 2 | 0.4% | |
| 12 | 7 | 1.4% | |
| 13 | 4 | 0.8% | |
| 14 | 6 | 1.2% |
| Value | Count | Frequency (%) | |
| 2000 | 1 | 0.2% | |
| 1680 | 1 | 0.2% | |
| 1630 | 1 | 0.2% | |
| 1350 | 1 | 0.2% | |
| 1250 | 2 | 0.4% |
Aspartate_Aminotransferase
Real number (ℝ≥0)
| Distinct count | 154 |
|---|---|
| Unique (%) | 31.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 108.17043121149898 |
|---|---|
| Minimum | 10 |
| Maximum | 4929 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.8 KiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 15 |
| Q1 | 24 |
| median | 40 |
| Q3 | 78 |
| 95-th percentile | 400.7 |
| Maximum | 4929 |
| Range | 4919 |
| Interquartile range (IQR) | 54 |
Descriptive statistics
| Standard deviation | 309.7203068 |
|---|---|
| Coefficient of variation (CV) | 2.863262199 |
| Kurtosis | 136.9981103 |
| Mean | 108.1704312 |
| Median Absolute Deviation (MAD) | 19 |
| Skewness | 10.22427137 |
| Sum | 52679 |
| Variance | 95926.66842 |
| Value | Count | Frequency (%) | |
| 23 | 15 | 3.1% | |
| 20 | 14 | 2.9% | |
| 21 | 13 | 2.7% | |
| 30 | 12 | 2.5% | |
| 25 | 12 | 2.5% | |
| 22 | 12 | 2.5% | |
| 29 | 11 | 2.3% | |
| 28 | 11 | 2.3% | |
| 19 | 10 | 2.1% | |
| 40 | 10 | 2.1% | |
| Other values (144) | 367 | 75.4% |
| Value | Count | Frequency (%) | |
| 10 | 1 | 0.2% | |
| 11 | 2 | 0.4% | |
| 12 | 5 | 1.0% | |
| 13 | 3 | 0.6% | |
| 14 | 8 | 1.6% |
| Value | Count | Frequency (%) | |
| 4929 | 1 | 0.2% | |
| 2946 | 1 | 0.2% | |
| 1600 | 1 | 0.2% | |
| 1500 | 1 | 0.2% | |
| 1050 | 2 | 0.4% |
Total_Protiens
Real number (ℝ≥0)
| Distinct count | 57 |
|---|---|
| Unique (%) | 11.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.459137577002054 |
|---|---|
| Minimum | 2.7 |
| Maximum | 9.6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.8 KiB |
Quantile statistics
| Minimum | 2.7 |
|---|---|
| 5-th percentile | 4.6 |
| Q1 | 5.7 |
| median | 6.5 |
| Q3 | 7.2 |
| 95-th percentile | 8.1 |
| Maximum | 9.6 |
| Range | 6.9 |
| Interquartile range (IQR) | 1.5 |
Descriptive statistics
| Standard deviation | 1.092959924 |
|---|---|
| Coefficient of variation (CV) | 0.1692114327 |
| Kurtosis | 0.2505812934 |
| Mean | 6.459137577 |
| Median Absolute Deviation (MAD) | 0.7 |
| Skewness | -0.3276127637 |
| Sum | 3145.6 |
| Variance | 1.194561395 |
| Value | Count | Frequency (%) | |
| 7 | 27 | 5.5% | |
| 6 | 25 | 5.1% | |
| 6.8 | 24 | 4.9% | |
| 6.2 | 21 | 4.3% | |
| 7.1 | 18 | 3.7% | |
| 6.9 | 18 | 3.7% | |
| 8 | 17 | 3.5% | |
| 7.2 | 17 | 3.5% | |
| 7.3 | 16 | 3.3% | |
| 6.1 | 16 | 3.3% | |
| Other values (47) | 288 | 59.1% |
| Value | Count | Frequency (%) | |
| 2.7 | 1 | 0.2% | |
| 2.8 | 1 | 0.2% | |
| 3 | 1 | 0.2% | |
| 3.6 | 2 | 0.4% | |
| 3.7 | 1 | 0.2% |
| Value | Count | Frequency (%) | |
| 9.6 | 1 | 0.2% | |
| 9.5 | 1 | 0.2% | |
| 8.9 | 1 | 0.2% | |
| 8.7 | 1 | 0.2% | |
| 8.6 | 2 | 0.4% |
Albumin
Real number (ℝ≥0)
| Distinct count | 39 |
|---|---|
| Unique (%) | 8.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.1778234086242296 |
|---|---|
| Minimum | 0.9 |
| Maximum | 5.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.8 KiB |
Quantile statistics
| Minimum | 0.9 |
|---|---|
| 5-th percentile | 1.8 |
| Q1 | 2.6 |
| median | 3.2 |
| Q3 | 3.8 |
| 95-th percentile | 4.4 |
| Maximum | 5.5 |
| Range | 4.6 |
| Interquartile range (IQR) | 1.2 |
Descriptive statistics
| Standard deviation | 0.8010544656 |
|---|---|
| Coefficient of variation (CV) | 0.2520764569 |
| Kurtosis | -0.3609524225 |
| Mean | 3.177823409 |
| Median Absolute Deviation (MAD) | 0.6 |
| Skewness | -0.1056334395 |
| Sum | 1547.6 |
| Variance | 0.6416882568 |
| Value | Count | Frequency (%) | |
| 3 | 34 | 7.0% | |
| 4 | 29 | 6.0% | |
| 2.9 | 27 | 5.5% | |
| 3.1 | 24 | 4.9% | |
| 3.9 | 23 | 4.7% | |
| 3.2 | 22 | 4.5% | |
| 3.5 | 21 | 4.3% | |
| 2.5 | 20 | 4.1% | |
| 3.3 | 20 | 4.1% | |
| 3.7 | 18 | 3.7% | |
| Other values (29) | 249 | 51.1% |
| Value | Count | Frequency (%) | |
| 0.9 | 2 | 0.4% | |
| 1.4 | 3 | 0.6% | |
| 1.5 | 3 | 0.6% | |
| 1.6 | 7 | 1.4% | |
| 1.7 | 2 | 0.4% |
| Value | Count | Frequency (%) | |
| 5.5 | 2 | 0.4% | |
| 5 | 1 | 0.2% | |
| 4.9 | 3 | 0.6% | |
| 4.8 | 2 | 0.4% | |
| 4.7 | 3 | 0.6% |
Albumin_and_Globulin_Ratio
Real number (ℝ≥0)
| Distinct count | 63 |
|---|---|
| Unique (%) | 12.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.9626899383983574 |
|---|---|
| Minimum | 0.3 |
| Maximum | 1.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.8 KiB |
Quantile statistics
| Minimum | 0.3 |
|---|---|
| 5-th percentile | 0.5 |
| Q1 | 0.8 |
| median | 1 |
| Q3 | 1.1 |
| 95-th percentile | 1.5 |
| Maximum | 1.9 |
| Range | 1.6 |
| Interquartile range (IQR) | 0.3 |
Descriptive statistics
| Standard deviation | 0.2923777089 |
|---|---|
| Coefficient of variation (CV) | 0.3037091147 |
| Kurtosis | 0.2602027705 |
| Mean | 0.9626899384 |
| Median Absolute Deviation (MAD) | 0.2 |
| Skewness | 0.4464170371 |
| Sum | 468.83 |
| Variance | 0.08548472465 |
| Value | Count | Frequency (%) | |
| 1 | 95 | 19.5% | |
| 0.8 | 54 | 11.1% | |
| 0.9 | 47 | 9.7% | |
| 0.7 | 41 | 8.4% | |
| 1.1 | 38 | 7.8% | |
| 1.2 | 31 | 6.4% | |
| 0.6 | 27 | 5.5% | |
| 1.3 | 25 | 5.1% | |
| 0.5 | 19 | 3.9% | |
| 1.4 | 17 | 3.5% | |
| Other values (53) | 93 | 19.1% |
| Value | Count | Frequency (%) | |
| 0.3 | 2 | 0.4% | |
| 0.35 | 1 | 0.2% | |
| 0.4 | 7 | 1.4% | |
| 0.45 | 1 | 0.2% | |
| 0.47 | 2 | 0.4% |
| Value | Count | Frequency (%) | |
| 1.9 | 1 | 0.2% | |
| 1.85 | 2 | 0.4% | |
| 1.8 | 3 | 0.6% | |
| 1.72 | 1 | 0.2% | |
| 1.7 | 4 | 0.8% |
Liver_Problem
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.8 KiB |
| 1 | |
|---|---|
| 2 |
| Value | Count | Frequency (%) | |
| 1 | 340 | 69.8% | |
| 2 | 147 | 30.2% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 487.0 B |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 361 | 74.1% | |
| 1 | 126 | 25.9% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | Age | Total_Bilirubin | Direct_Bilirubin | Alkaline_Phosphotase | Alamine_Aminotransferase | Aspartate_Aminotransferase | Total_Protiens | Albumin | Albumin_and_Globulin_Ratio | Liver_Problem | Gender_Female | Gender_Male | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 65 | 0.7 | 0.1 | 187 | 16 | 18 | 6.8 | 3.3 | 0.90 | 1 | 1 | 0 |
| 1 | 1 | 62 | 10.9 | 5.5 | 699 | 64 | 100 | 7.5 | 3.2 | 0.74 | 1 | 0 | 1 |
| 2 | 2 | 62 | 7.3 | 4.1 | 490 | 60 | 68 | 7.0 | 3.3 | 0.89 | 1 | 0 | 1 |
| 3 | 3 | 58 | 1.0 | 0.4 | 182 | 14 | 20 | 6.8 | 3.4 | 1.00 | 1 | 0 | 1 |
| 4 | 4 | 72 | 3.9 | 2.0 | 195 | 27 | 59 | 7.3 | 2.4 | 0.40 | 1 | 0 | 1 |
| 5 | 5 | 46 | 1.8 | 0.7 | 208 | 19 | 14 | 7.6 | 4.4 | 1.30 | 1 | 0 | 1 |
| 6 | 6 | 26 | 0.9 | 0.2 | 154 | 16 | 12 | 7.0 | 3.5 | 1.00 | 1 | 1 | 0 |
| 7 | 7 | 29 | 0.9 | 0.3 | 202 | 14 | 11 | 6.7 | 3.6 | 1.10 | 1 | 1 | 0 |
| 8 | 8 | 17 | 0.9 | 0.3 | 202 | 22 | 19 | 7.4 | 4.1 | 1.20 | 2 | 0 | 1 |
| 9 | 9 | 55 | 0.7 | 0.2 | 290 | 53 | 58 | 6.8 | 3.4 | 1.00 | 1 | 0 | 1 |
Last rows
| df_index | Age | Total_Bilirubin | Direct_Bilirubin | Alkaline_Phosphotase | Alamine_Aminotransferase | Aspartate_Aminotransferase | Total_Protiens | Albumin | Albumin_and_Globulin_Ratio | Liver_Problem | Gender_Female | Gender_Male | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 477 | 490 | 53 | 0.8 | 0.2 | 193 | 96 | 57 | 6.7 | 3.6 | 1.16 | 1 | 1 | 0 |
| 478 | 491 | 27 | 1.0 | 0.3 | 180 | 56 | 111 | 6.8 | 3.9 | 1.85 | 2 | 0 | 1 |
| 479 | 492 | 35 | 1.0 | 0.3 | 805 | 133 | 103 | 7.9 | 3.3 | 0.70 | 1 | 1 | 0 |
| 480 | 493 | 65 | 0.7 | 0.2 | 265 | 30 | 28 | 5.2 | 1.8 | 0.52 | 2 | 0 | 1 |
| 481 | 494 | 25 | 0.7 | 0.2 | 185 | 196 | 401 | 6.5 | 3.9 | 1.50 | 1 | 0 | 1 |
| 482 | 495 | 32 | 0.7 | 0.2 | 165 | 31 | 29 | 6.1 | 3.0 | 0.96 | 2 | 0 | 1 |
| 483 | 496 | 24 | 1.0 | 0.2 | 189 | 52 | 31 | 8.0 | 4.8 | 1.50 | 1 | 0 | 1 |
| 484 | 497 | 67 | 2.2 | 1.1 | 198 | 42 | 39 | 7.2 | 3.0 | 0.70 | 1 | 0 | 1 |
| 485 | 498 | 68 | 1.8 | 0.5 | 151 | 18 | 22 | 6.5 | 4.0 | 1.60 | 1 | 0 | 1 |
| 486 | 499 | 55 | 3.6 | 1.6 | 349 | 40 | 70 | 7.2 | 2.9 | 0.60 | 1 | 0 | 1 |